modeling technique
Generative AI for Strategic Plan Development
Given recent breakthroughs in Generative Artificial Intelligence (GAI) and Large Language Models (LLMs), more and more professional services are being augmented through Artificial Intelligence (AI), which once seemed impossible to automate. This paper presents a modular model for leveraging GAI in developing strategic plans for large scale government organizations and evaluates leading machine learning techniques in their application towards one of the identified modules. Specifically, the performance of BERTopic and Non-negative Matrix Factorization (NMF) are evaluated in their ability to use topic modeling to generate themes representative of Vision Elements within a strategic plan. To accomplish this, BERTopic and NMF models are trained using a large volume of reports from the Government Accountability Office (GAO). The generated topics from each model are then scored for similarity against the Vision Elements of a published strategic plan and the results are compared. Our results show that these techniques are capable of generating themes similar to 100% of the elements being evaluated against. Further, we conclude that BERTopic performs best in this application with more than half of its correlated topics achieving a "medium" or "strong" correlation. A capability of GAI-enabled strategic plan development impacts a multi-billion dollar industry and assists the federal government in overcoming regulatory requirements which are crucial to the public good. Further work will focus on the operationalization of the concept proven in this study as well as viability of the remaining modules in the proposed model for GAI-generated strategic plans.
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.84)
ORLM: Training Large Language Models for Optimization Modeling
Tang, Zhengyang, Huang, Chenyu, Zheng, Xin, Hu, Shixi, Wang, Zizhuo, Ge, Dongdong, Wang, Benyou
Large Language Models (LLMs) have emerged as powerful tools for tackling complex Operations Research (OR) problem by providing the capacity in automating optimization modeling. However, current methodologies heavily rely on prompt engineering (e.g., multi-agent cooperation) with proprietary LLMs, raising data privacy concerns that could be prohibitive in industry applications. To tackle this issue, we propose training open-source LLMs for optimization modeling. We identify four critical requirements for the training dataset of OR LLMs, design and implement OR-Instruct, a semi-automated process for creating synthetic data tailored to specific requirements. We also introduce the IndustryOR benchmark, the first industrial benchmark for testing LLMs on solving real-world OR problems. We apply the data from OR-Instruct to various open-source LLMs of 7b size (termed as ORLMs), resulting in a significantly improved capability for optimization modeling. Our best-performing ORLM achieves state-of-the-art performance on the NL4OPT, MAMO, and IndustryOR benchmarks. Our code and data are available at \url{https://github.com/Cardinal-Operations/ORLM}.
Optimal EEG Electrode Set for Emotion Recognition From Brain Signals: An Empirical Quest
Prodhan, Rumman Ahmed, Akter, Sumya, Pias, Tanmoy Sarkar, Adnan, Md. Akhtaruzzaman
The human brain is a complex organ, still completely undiscovered, that controls almost all the parts of the body. Apart from survival, the human brain stimulates emotions. Recent research indicates that brain signals can be very effective for emotion recognition. However, which parts of the brain exhibit most of the emotions is still under-explored. In this study, we empirically analyze the contribution of each part of the brain in exhibiting emotions. We use the DEAP dataset to find the most optimal electrode set which eventually leads to the effective brain part associated with emotions. We use Fast Fourier Transformation for effective feature extraction and a 1D-CNN with residual connection for classification. Though 32 electrodes from the DEAP dataset got an accuracy of 97.34%, only 12 electrodes (F7, P8, O1, F8, C4, T7, PO3, Fp1, Fp2, O2, P3, and Fz) achieve 95.81% accuracy. This study also shows that adding more than 10 electrodes does not improve performance significantly. Moreover, the frontal lobe is the most important for recognizing emotion.
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.05)
- Oceania > Australia > Queensland (0.04)
- North America > United States > Virginia > Montgomery County > Blacksburg (0.04)
- Asia > India > NCT > New Delhi (0.04)
UCTB: An Urban Computing Tool Box for Spatiotemporal Crowd Flow Prediction
Chen, Liyue, Chai, Di, Wang, Leye
Spatiotemporal crowd flow prediction is one of the key technologies in smart cities. Currently, there are two major pain points that plague related research and practitioners. Firstly, crowd flow is related to multiple domain knowledge factors; however, due to the diversity of application scenarios, it is difficult for subsequent work to make reasonable and comprehensive use of domain knowledge. Secondly, with the development of deep learning technology, the implementation of relevant techniques has become increasingly complex; reproducing advanced models has become a time-consuming and increasingly cumbersome task. To address these issues, we design and implement a spatiotemporal crowd flow prediction toolbox called UCTB (Urban Computing Tool Box), which integrates multiple spatiotemporal domain knowledge and state-of-the-art models simultaneously. The relevant code and supporting documents have been open-sourced at https://github.com/uctb/UCTB.
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
- Asia > China > Hong Kong (0.04)
- Asia > China > Beijing > Beijing (0.04)
- (3 more...)
- Transportation > Passenger (0.46)
- Transportation > Ground > Road (0.46)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.96)
TorchScale: Transformers at Scale
Ma, Shuming, Wang, Hongyu, Huang, Shaohan, Wang, Wenhui, Chi, Zewen, Dong, Li, Benhaim, Alon, Patra, Barun, Chaudhary, Vishrav, Song, Xia, Wei, Furu
Large Transformers have achieved state-of-the-art performance across many tasks. Most open-source libraries on scaling Transformers focus on improving training or inference with better parallelization. In this work, we present TorchScale, an open-source toolkit that allows researchers and developers to scale up Transformers efficiently and effectively. TorchScale has the implementation of several modeling techniques, which can improve modeling generality and capability, as well as training stability and efficiency. Experimental results on language modeling and neural machine translation demonstrate that TorchScale can successfully scale Transformers to different sizes without tears. The library is available at https://aka.ms/torchscale.
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
A Unified Comparison of User Modeling Techniques for Predicting Data Interaction and Detecting Exploration Bias
Ha, Sunwoo, Monadjemi, Shayan, Garnett, Roman, Ottley, Alvitta
Abstract-- The visual analytics community has proposed several user modeling algorithms to capture and analyze users' interaction behavior in order to assist users in data exploration and insight generation. For example, some can detect exploration biases while others can predict data points that the user will interact with before that interaction occurs. Researchers believe this collection of algorithms can help create more intelligent visual analytics tools. However, the community lacks a rigorous evaluation and comparison of these existing techniques. As a result, there is limited guidance on which method to use and when. Our paper seeks to fill in this missing gap by comparing and ranking eight user modeling algorithms based on their performance on a diverse set of four user study datasets. We analyze exploration bias detection, data interaction prediction, and algorithmic complexity, among other measures. Based on our findings, we highlight open challenges and new directions for analyzing user interactions and visualization provenance. After surveying the body of work, we selected seven proposed Researchers in the visualization community have long viewed interaction techniques and standardized their input and output specifications to as an analytic discourse between the analyst and the visualization account for a variety of datasets. In addition to the selected models, system [40].
- North America > United States > Washington > Benton County > Richland (0.04)
- North America > United States > Virginia > Fairfax County > McLean (0.04)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.88)
Guide To Finding The Right Predictive Maintenance Machine Learning Techniques - KDnuggets
If you think carefully, you'll realize that the world we live in today is dependent heavily on the functioning of machines and systems. Almost everything from a light switch to a smartphone, from an elevator to a car, is a machine or a system that controls a machine. However, any machine is subject to wear and tear. What happens to a life so dependent on machines, when that particular machine breaks down? This is precisely why there's a dire need for predictive maintenance with machine learning.
Exploring Context Modeling Techniques on the Spatiotemporal Crowd Flow Prediction
In the big data and AI era, context is widely exploited as extra information which makes it easier to learn a more complex pattern in machine learning systems. However, most of the existing related studies seldom take context into account. The difficulty lies in the unknown generalization ability of both context and its modeling techniques across different scenarios. To fill the above gaps, we conduct a large-scale analytical and empirical study on the spatiotemporal crowd prediction (STCFP) problem that is a widely-studied and hot research topic. We mainly make three efforts:(i) we develop new taxonomy about both context features and context modeling techniques based on extensive investigations in prevailing STCFP research; (ii) we conduct extensive experiments on seven datasets with hundreds of millions of records to quantitatively evaluate the generalization ability of both distinct context features and context modeling techniques; (iii) we summarize some guidelines for researchers to conveniently utilize context in diverse applications.
- Transportation > Ground > Road (1.00)
- Transportation > Electric Vehicle (0.68)
- Automobiles & Trucks (0.68)
Questions You Should Ask Before Starting Your ML Project
The upside that ML models can learn from data without explicitly defining a wide range of unmanageable instructions opens up the doors for a broad range of applications. On top of that, there is a sentiment that VCs and Angels pounce on "ML-powered" organizations which blurs our vision to actually see through the risks associated with ML projects. Also, what works as an impediment in evaluating a solution is when technical founders view their "interesting" ML approach in a vacuum irrespective of how appropriate that approach is for the product. There is no certain process to predict the success of an ML project but there are guidelines and practices that can help in reducing the associated risk and that is the sweet spot where this article fits in. Think through before dive deep!
Using Machine Learning to Analyze Taylor Swift's Lyrics
For the past few months, the Curriculum team at Codecademy has been hard at work creating Machine Learning courses. While we all loved writing the courses, we also wanted to see what we could do with real-world data. As a result, we challenged each other to find a use for machine learning in a topic that we were passionate about. It's said that popular music is a reflection of society, a barometer for our collective wants, fears, and emotional states. Others are of the belief that music is more a reflection of the artist, a diary that's been flung from the nightstand drawer into the media frenzy of our modern world.
- Media > Music (1.00)
- Leisure & Entertainment (1.00)